A Methodology for Collection Selection in Heterogeneous Contexts
نویسندگان
چکیده
In this paper we demonstrate that in an ideal Distributed Information Retrieval environment, taking the ability of each collection server to return relevant documents into account when selecting collections can be effective. Based on this assumption, we suggest a new approach to resolve the collection selection problem. In order to predict a collection's ability to return relevant documents, we inspect a limited number n of documents retrieved from each collection and analyze the proximity of search keywords within them. In our experiments, we vary the underlying parameter n of our suggested model to define the most appropriate number of top documents to be inspected. Moreover, we evaluate the retrieval effectiveness of our approach and compare it with both the centralized indexing and the CORI approaches [1], [16]. Preliminary results from these experiments, conducted on WT10g test collection, tend to demonstrate that our suggested method can achieve appreciable retrieval effectiveness.
منابع مشابه
An Integrated Approach for Collection Center Selection in Reverse Logistics
In this paper, a hybrid multi-criteria decision-making (MCDM)-method and mixed integer linear programming (MILP) approach in order to evaluation of the returned products' collectors along with their ordered quantities, is utilized. Firstly, the most important criteria of collection center in the car industry are identified. Then, in order to evaluate these proposed criteria, a hybrid Fuzzy Deci...
متن کاملبررسی نقش انواع بافتار همنویسهها در تعیین شباهت بین مدارک
Aim: Automatic information retrieval is based on the assumption that texts contain content or structural elements that can be used in word sense disambiguation and thereby improving the effectiveness of the results retrieved. Homographs are among the words requiring sense disambiguation. Depending on their roles and positions in texts, homograph contexts could be divided to different types, wit...
متن کاملActive Learning and Mapping: A Survey and Conception of a New Stochastic Methodology for High Throughput Materials Discovery
The data mining technology increasingly employed into new industrial processes, which require automatic analysis of data and related results in order to quickly proceed to conclusions. However, for some applications, an absolute automation may not be appropriate. Unlike traditional data mining, contexts deal with voluminous amounts of data, some domains are actually characterized by a scarcity ...
متن کاملNanocrystalline MgAl2O4 as a Heterogeneous Nanocatalyst for the Synthesis of 2-Ketomethylquinolines Using Green Design Methodology
In this investigation, a facile and green sonochemical route has been developed for the synthesis of 2-Ketomethylquinolines by using 2-methylquinolines and several acyl chlorides in the presence of nanocrystalline MgAl2O4 as an efficient heterogeneous catalyst. The combination of nanocatalyst and ultrasonic process afforded corresponding ketomethyl quinolines in shorter re...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002